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ABSTRACT Very large combinatorial libraries of small 
molecules on solid supports can now be synthesized and each 
library element can be identified after synthesis by using 
chemical tags. These tag-encoded libraries are potentially 
useful in drug discovery, and, to test this utility directly, we 
have targeted carbonic anhydrase (carbonate dehydratase; 
carbonate hydro-lyase, EC 4.2.1.1) as a model. Two libraries 
consisting of a total of 7870 members were synthesized, and 
structure-activity relationships based on the structures pre- 
dicted by the tags were derived. Subsequently, an active 
representative of each library was resynthesized {2-[N-(4- 
sulfamoylbenzoyI)-4'-aminocyclohexanespiro]-4-oxo-7-hy- 
droxy-23-dihydrobenzopyran and [JV-(4-sulfamoylbenzoyl)-L- 
leucyl]piperidine-3-carboxylic acid} and these compounds 
were shown to have nanomolar dissociation constants (15 and 
4 nM, respectively). In addition, a focused sublibrary of 217 
sulfa moylbenzamides was synthesized and revealed a clear, 

'able structure-activity relationship describing isozyme- 

ictive carbonic anhydrase inhibitors. 

The current national focus on health care reform has high- 
lighted the continuing importance of developing more cost- 
effective methods for treating disease. Historically, the phar- 
maceutical industry has contributed to this effort by discov- 
ering and developing therapeutic molecules. The development 
of drug candidates is expensive (1, 2), and preclinical research 
toward the discovery and development of lead structures 
contributes significantly to this expense. For example, in" 1991, 
the costs of medicinal chemistry and biological testing prior to 
safety assessment amounted to nearly 30% of domestic re- 
search costs (3). Apart from the economics of drug discovery, 
classical medicinal chemistry cannot efficiently address the 
plethora of new biochemical targets suggested by recent 
discoveries in molecular genetics, and lead structures from 
natural sources are often too complex for the cost-effective 
synthesis of analogs. Hence, targets for therapeutic interven- 
tion cannot be rapidly and efficiently exploited. 

Combinatorial chemical synthesis applied to drug discovery 
promises to improve the productivity of medicinal chemistry 
both by significantly increasing the number of molecules 
available for testing and by providing facile routes toward 
synthetic analogs of active molecules. Many strategies for the 
generation of chemical diversity have been proposed (4-7). 
With a single exception (4), these approaches have been 
confined to~the synthesis of flexible oligomeric ligands of 
relatively high molecular weight (e.g., peptides and oligonu- 
cleotides) which tend to be poor therapeutic agents, in part 

ecause of their lack of availability and stability in vivo. 
For drug discovery, small-molecule libraries containing 
diverse functionality in a variety of molecular scaffolds possess 
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the greatest utility. To construct such structurally diverse 
libraries while allowing the assignment of a structure to each 
member, we use a set of chemically stable molecular tags 
during solid-phase synthesis on polymeric beads (8). These 
tags can be used to unambiguously encode each bead with the 
synthetic scheme for its library member and are sufficiently 
robust to allow a wide range of chemical reactions during 
ligand construction. 

A drug discovery effort based on such combinatorial librar- 
ies must pass several tests to be successful. First, the library 
must be diverse enough to permit the identification of a small 
subset of active molecules among many less active or inactive 
ones. Second, the most promising candidates in the library 
should suggest a testable structure-activity relationship. Third, 
since selectivity for a particular target is often important, 
strategies for the analysis of libraries containing a significant 
fraction of active compounds should result in optimization of 
selectivity as well as activity. 

To explore the capabilities of encoded combinatorial librar- 
ies in a small-molecule drug discovery effort directed toward 
lead identification and optimization, we chose carbonic anhy- 
drase (carbonate dehydratase; carbonate hydro-lyase, EC 
4.2.1.1) as a therapeutically significant model, since inhibitors 
of carbonic anhydrase are known to be useful in ameliorating 
the symptoms of glaucoma. The inhibition of carbonic anhy- 
drase by compounds containing primary sulfamoyl groups 
(-S0 2 NH 2 ) has been well-characterized, both pharmacologi- 
cally (9— 11) and structurally (12-15). This allowed the design 
of test libraries that were predisposed toward showing carbonic 
anhydrase inhibition without explicitly including known inhib- 
itors. Finally, since a number of different forms of the enzyme 
have been characterized, we were able to perform experiments 
to probe isoform selectivity. 

MATERIALS AND METHODS 

Materials. TentaGel S-NH 2 resin (particle size, 130 jim) was 
obtained from Rapp Polymere (Tubingen, Germany). The 
photocleavable linker was synthesized as described (16). Car- 
bonic anhydrase was obtained from Sigma. White 96-well 
polystyrene plates were obtained from Dynatech. 

Library Synthesis and Analysis. Libraries were synthesized 
by the split synthesis protocol, and each bead thus contains one 
member of the library (17-19). All members were affixed via 
a photocleavable o-nitrobenzyl linker. To preserve the syn- 
thetic history of each bead, each library was indexed in a binary 
fashion using electrophone tags. These lags were attached by 
using [(CF 3 C02)2Rh]2-activated carbene insertion as de- 
scribed (20). After identification and isolation of beads, the 
tags were analyzed by gas chromatography (8). 

Abbreviations: DNSA, dansylamide; hCA(I) and hCA(II), human 
carbonic anhydrase isozymes I and II. 
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Synthesis of Dihydrobenzopyran Library. First, three dihy- 
droxyacetophenones (2-acetyl-l,4-dihydroxybenzene, 2-ace- 
tyl-l,5-dihydroxybenzene, and 2-acetyl-6-methyl-l,5-dihy- 
droxybenzene) were coupled using Mitsunobu conditions (tri- 
phenylphosphine/diethylazodicarboxylate in tetrahydrofuran 
at ambient temperature) to the o-nitrobenzyl linker, which was 
then coupled to the beads by carbodiimide coupling [di(iso- 
propylcarbodiimide)/hydroxybenzotriazole in dimethylform- 
amide]. These intermediates were cyclized on the resin with a 
set of seven ketones, four of which contained a protected 
amine functionality. Modification of the amine functionality 
(when present) with 31 different headgroups under standard 
conditions produced a library of ketones containing 381 
members. The ketones were then reduced with NaBFU, con- 
verted into the corresponding dithiolanes with ethane-1,2- 
dithiol, or left unaltered to produce a library of 1143 distinct 
compounds. 

Synthesis of Acylpiperidine Library. Library elements were 
coupled as either a carboxylic ester or a carbonate ester of an 
o-nitrobenzyl alcohol. First, five piperidines (2-hydroxy- 
methyl-, 3-hydroxymethyl-, 4-hydroxy-, 4-carboxylic-, and 
3-carboxylic-), and two linear moieties (5-amino-3-oxopenta- 
nol and 6-aminohexanoic acid) were attached to this linker 
either by phosgene coupling in solution (for the alcohols) or 
directly to the resin (for the acids). Next, 31 amino acids, 
including 28 examples of both D and L isomers of the 20 
commonly occurring a-amino acids, and 3 nonproteinogenic 
amino acids (piperidine-4-carboxylic acid, piperidine-3- 
carboxylic acid, and 6-aminohexanoic acid) were reacted with 
the free amine. Finally, 31 reagents were linked as carboxam- 
ides, carbamates, sulfonamides, and ureas, including three 
primary sulfonamides (4-sulfamoylbenzamido, 4-chloro-5- 
sulfamoylbenzamido, and 2,4-dichloro-5-sulfamoylbenz- 
amido). 

Library Analysis for Carbonic Anhydrase Inhibition. Beads 
were distributed into 96-well microtiter plates. Compounds were 
detached from the beads by UV irradiation (350 nm) and then 
transferred to assay plates. An assay solution (50 id) consisting of 
0.1 M phosphate buffer (pH 7.4) containing bovine carbonic 
anhydrase (03 fiM) and dansylamide (DNSA; 0.6 jiM) was added 
to each well Fluorescence values 214 nm; Ae m , 454 nm) were 
measured with a Perkin-Elmer model LS 50B spectrof luorimeter 
equipped with a microtiter plate reader accessory and were 
normalized (uninhibited, 100; empty, 0). Assay solutions from the 
initial screen identified as active were transferred to a second 
fluorescence plate and 5 of 2 mM DNSA in dimethyl sulfoxide 
was added to each well to increase [DNSA] ^600-fold in situ. 
Under these conditions, chlorothiazide (IQ — 75 nM) was dis- 
placed, while acetazolamide (K^ «• 7.5 nM) was not, suggesting 
that more potent inhibitors were less sensitive to displacement by 
elevated [DNSA]. 

Selectivity Analysis. Aliquots (one-third) from the same 
bead eluate were assayed against each isozyme. Concentra- 
tions of the two isozymes were matched using ejgo values from 
the literature (21), and the [DNSA] was varied according to the 
literature values for IQ (DNSA) of the isozymes (22). 

RESULTS AND DISCUSSION 

Two model libraries for drug discovery — a 1143-member 
dihydrobenzopyran library and a-6727-member acylpiperidine 
library— were designed. The first (dihydrobenzopyran) library 
(Fig. I A) was designed to include 4-sulfamoylbenzoyl amides 
that were expected to confer carbonic anhydrase activity. The 
second (acylpiperidine) library (Fig. IB) was designed around 
a core aminocarboxamide derived from 31 amino acids, in- 
cluding nonproteinogenic amino acids and both the D and L 
configurations of several naturally occurring a-amino acids. 
The carboxamide was formed from the carboxylic acid with 



seven amines including four substituted piperidines. Thi - 
different sulfamoylbenzoyl groups were incorporated at the 
final step to confer carbonic anhydrase activity. 

Activity of library members against bovine carbonic anhy- 
drase was assessed by using a fluorescence-based ligand- 
displacement assay (9). Because each synthetic library con- 
tained >1000 beads per member, we characterized the activity 
of the library statistically to optimize our screening conditions. 
The dihydrobenzopyran library was determined to contain 
^1.4% actives, identified as those wells where the fluorescence 
decreased by >5<r. Similarly, in the acylpiperidine library, w e 
determined that the samples contained « a 6% actives. These 
observations were consistent with the percentage of primary 
sulfonamides incorporated in the two libraries,, which was 
predicted to be 3% and 10% for the dihydrobenzopyran and 
acylpiperidine libraries, respectively. 

Over 2300 beads from the dihydrobenzopyran library were 
then assayed singly and 33 individual beads were chosen for 
decoding. Thus, 2.0 library equivalents were assayed, where 
one library equivalent is defined as the number of beads equal 
to the number of distinct library members. The probability that 
a given compound was not assayed is approximated as follows: 
In a library of AT total beads » M> the total number of library 
members, when S beads have been assayed, the probability that 
a given member has not been chosen is given by 



P s (0) ■ 



[1] 



which is approximately (e~ l ) s/M for large S. The quantity 5/:V/ 
isL, the number" of library equivalents that have been assayed, 
so P s (0) e~ L . For the case at hand, the fraction of the library 
assayed at least once is (1 - e~ 2 ) or 86%. 

The larger acylpiperidine library was pooled at the final 
reaction step into two smaller portions to permit a more 
convergent assay strategy. These pools consisted of 3472 and 
3255 members, the larger portion containing all the primary 
sulfonamides. For the smaller portion, *»5 library equivalents 
(^17,000 beads) were assayed at 10 beads per well, and no 
actives were identified. Thus, in Eq. 1,L «• 5 andP 5 (0) is <0.01. 
Hence, >99% of the members were assayed at least once, and 
none were found to be active under the conditions described. 

We therefore focused our continued efforts on the portion 
of the acylpiperidine library that contained primary sulfon- 
amides. Here, half the eluates from a total of 4320 single beads 
(1.3 library equivalents) were assayed individually, and >300 
actives were identified (Fig. 2A). To estimate the relative 
potency of the active inhibitors, the stringency of the assay was 
increased by raising the concentration of the competitor, 
DNSA However, the associated background fluorescence 
made the fluorescence decrease caused by inhibitors less easily 
detected. Potent actives were therefore judged as differing by 
>3cr compared to the median active compound. Eighteen 
high-affinity actives were thereby identified and decoded (Fig. 
IB). 

As anticipated, we found that carbonic anhydrase inhibitors 
from either library exclusively contained the sulfamoyl group 
(-SO2NH2). From the actives, we chose two synthetic targets, 
one from the dihydrobenzopyran library (compound 1) and 
one of the more potent actives from the acylpiperidine library 
(compound 2). These compounds were resynthesized, their 
structures were confirmed spectroscopically, and their K4 
values vs. carbonic anhydrase were determined experimentally 
(Table 1). The low nanomolar values for compounds 1 and 2 
support the use of tag-derived information for structure- 
activity assessments in the libraries we report. 

Significantly, no structures containing a 4-chIoro-5-sulfa- 
moylbenzoyl moiety were found among the tag-predicted 
structures for potent actives in the acylpiperidine library. 
Furthermore, while the other two sulfamoylbenzamides v.vrc 
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well-represented, they showed markedly different structure- 
activity relationships. For example, the 4-sulfamoylbenz- 
amides appear to prefer a hydrophobic R2 group of the L 
configuration, while the 2,4-dichloro-5-sutfamoylbeiizamides 
appear to tolerate more diversity at R2, including p-alanine. 
Thus, to test the validity of this observation, library-specific 
homologs of the acylpiperidine (compound 2) were synthe- 
sized (Table 1, compounds 3 and 4). The 150-fold increase in 
Kd seen upon substitution further validates the ability of the 
screen to select active compounds and supports the divergent 
structure-activity relationships observed with the various sul- 
fonamide headgroups. 

In developing a class of compounds for use as therapeutic 
agents, increased affinity is but one criterion for optimization. 
Isozyme selectivity is often an additional critical property 
required to minimize side effects. The selectivity of a large 
library is difficult to optimize convergently (i.e., with >1 
element per assay), because such assays must measure the 



Fig. 1. Synthesis and tagging of an encoded com- 
binatorial library. Library members were synthesized in 
parallel on polymeric beads by a split synthesis protocol 
(17-19). Polyethylene glycol-graf ted polystyrene beads 
(TentaGel S-NH 2 ; Rapp Polymere, Tubingen, Ger- 
many) were modified to include a photocleavable o- 
nitrobenzyl linker. General structures and examples of 
structures are illustrated for the dihydrobenzopyran 
library (A) and the acylpiperidine library (£). An out- 
line of the binary tagging method is also shown (C). 
TFA, trifluoroacetate. 

absence of activity against a particular target or relative levels 
of activity vs. multiple targets. As the percentage of actives 
increases, the probability that the observed affinities are the 
property of multiple elements increases. Thus, approaches 
taken to identify high-affinity peptides in a convergent fashion 
(e.g., see ref. 6) become impractical with selectivity, and 
focused sublibraries assayed singly become pertinent. To eval- 
uate the potential for using encoded combinatorial libraries to 
optimize selectivity, a 217-member sublibrary (consisting of 
the 4-sulfamoylbenzamides from the larger acylpiperidine 
library) was prepared and evaluated against human carbonic 
anhydrase isozymes I and II [CA(I) and CA(II)]. 

To isolate selectivity as a variable, two factors were nor- 
malized. First, equal inhibitor concentrations were exposed to 
each isozyme by taking aliquots from the same bead eluate. 
Thus, onetthird of a complete eluate was assayed vs. human 
CA(I), and one-third was assayed vs. CA(II). Second, the 
concentrations of the two isozymes were matched, but the 
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Fig. 3 A shows a selectivity plot comparing the normalized 
fluorescence values for the two enzymes. In this figure, region 
A contains compounds that show potency vs. both isozymes, 
while region B contains compounds that are less selective for 
CA(II) than the median, and region C contains compounds 
that are more selective than the median. Sulfonamides that are 
not particularly active against either isozyme are found in 
region D. Beads from each region were decoded, and examples 
of the structures predicted by these codes are shown in Fig. 2B. 
The potent compound that was identified in screening against 
bovine carbonic anhydrase (Table 1, compound 2) was iden- 



Normalized F 

Fig. 2. Histogram analysis of combinatorial libraries. (A) Low- 
stringency library analysis. Half the eluates from a total of 4320 single 
beads (12 library equivalents) were assayed individually, and >300 
actives (5c) were identified. For the dihydrobenzopyran library, 
>2300 members (2.0 library equivalents) were assayed singly and 33 
individual beads were identified and decoded. (B) Reanalysis of 
positives from y4 at higher stringency. To increase the stringency of the 
assay, the concentration of DNS A was increased »* 600-fold in situ, and 
18 potent actives were selected for decoding. 



[DNSA] was varied according to the literature values for 
^dnsa of the isozymes. These requirements derive from the 
binding equation, given relative to isozyme x 



200 



F x * [Er-DNSA] = 



[E] t( 



1 + 



^DNSAjc 

[DNSA]j 



[2] 



Thus, when Et 0 t is equal, the ratio [DNSA]/K D nsa is equal, and 
[I] is equal, the only variable that can affect F is the dissociation 
constant for the inhibitor, K\. To identify samples for which 
there is a significant difference in K x for the two isozymes, the 
normalized fluorescence values were analyzed graphically 
(Fig. 3). 

Table 1. Analysis of compounds and validation of tag-derived 
structure-activity relationships 
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Structures of compounds l^t were confirmed analytically by ] H 
NMR spectroscopy and mass spectrometry. Ka values were measured 
as described (7) with excel software to solve the multiple equilibria 
numerically, b, Bovine. 
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Fig. 3. Selectivity assay of a highly active sublibrary. (A) Inhibitors 
were analyzed independently for each isozyme as in Fig. 2 and then 
plotted. The observation that the majority of the library members are 
found above the diagonal suggests that the selectivity for hCA(II) may 
be a property of this class of compounds rather than a characteristic 
of a given member. (B) From tag-derived relationships, there are clear 
trends associated with the different regions, and some examples of 
compounds are shown. The characteristics of the potent inhibitors 
(region A) are described above, and the potent compound that was 
identified in assays against bovine carbonic anhydrase (Table 1. 
compound 2) was found here. Region B (relatively nonselective) shows 
a preference for cyclic imino acids in the central region, which may 
reflect differential hydrogen bonding by the carboxamido nitrogen of 
the sulfamoylbenzamide (13) or to a positive discrimination by the ring. 
Region C [relatively hCA(II) selective] shows a preponderance of 
polar side chains, as well as a relative lack of sensitivity toward the C° 
configuration. Region D (low activity) contains more polar com- 
pounds but, notably, one D-leucyl side chain that complements the 
strong preference for hydrophobic groups of the l stereoconfiguriiii° n 
noted above. 
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tified again in region A. Analysis of the selectivity of this 
compound showed that it has a 3-fold selectivity for human (h) 
CA(II) {tfdlhCA(II)] = 11 nM;/C d [hCA(I)] = 33 nM}, but this 
* would not be apparent from the plot because of its 
ively high affinity. Since the other compounds in this small 
sublibrary are also analogs of 4-sulfamoylbenzamide, the ob- 
servation that most library members are found above the 
diagonal suggests that the selectivity observed for compound 
2 may be a property of this class of compounds rather than a 
characteristic of this particular member. Nevertheless, there 
are clear trends associated with the different regions. Region 
B (relatively nonselective) shows a preference for cyclic imino 
acids in the central region, which may reflect differential 
hydrogen bonding by the carboxamido nitrogen of the sulfa- 
moylbenzamide (13), or to a positive discrimination by the ring. 
Region C [relatively hCA(II) selective] shows a preponderance 
of polar side chains as well as a relative lack of sensitivity 
toward the C° configuration. Region D (low activity) contains 
more polar compounds but, notably, one D-leucyl side chain 
that complements the strong preference for hydrophobic 
groups of the L stereoconfiguration noted above. Together, the 
trends derived from library analysis suggest that more selective 
compounds could be synthesized, either as single compounds 
or as appropriately designed focused libraries. 

Conclusions. We have synthesized encoded libraries con- 
taining 10M0 4 small molecules by using diverse chemistries. 
Active carbonic anhydrase inhibitors within these libraries 
have been identified and optimized by using directed subli- 
braries based on the original lead and the chemistry developed 
for its synthesis. This application of combinatorial libraries for 
drug discovery addresses two pressing needs of the pharma- 
ceutical industry. First, small-molecule leads will become more 
readily available as the size and diversity of synthetic libraries 
ceases. This, in turn, will accelerate the discovery of com- 
unds active against a variety of therapeutic targets, espe- 
cially those that emerge from the explosion in sequence 
information arising from the human genome project. Second, 
synthetic chemists involved in drug discovery will be more 
efficiently engaged to solve sophisticated structure-activity 
problems. The success of the combinatorial approach empha- 
sizes the power of techniques for the rapid synthesis and 
evaluation of large numbers of possible structures and antic- 
ipates a new age of drug discovery and optimization. In the 
future, analysis of combinatorial libraries may be used to direct 
research efforts toward productive avenues by suggesting the 
synthesis either of a handful of pertinent test compounds or of 
combinatorial libraries based on relevant structural types. 
Indeed, such a combinatorial approach may be productively 
applied not just in pharmaceutical discovery, but in any area 
where optimization of molecular properties is necessary. 
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